Integrative random forest for gene regulatory network inference
نویسندگان
چکیده
MOTIVATION Gene regulatory network (GRN) inference based on genomic data is one of the most actively pursued computational biological problems. Because different types of biological data usually provide complementary information regarding the underlying GRN, a model that integrates big data of diverse types is expected to increase both the power and accuracy of GRN inference. Towards this goal, we propose a novel algorithm named iRafNet: integrative random forest for gene regulatory network inference. RESULTS iRafNet is a flexible, unified integrative framework that allows information from heterogeneous data, such as protein-protein interactions, transcription factor (TF)-DNA-binding, gene knock-down, to be jointly considered for GRN inference. Using test data from the DREAM4 and DREAM5 challenges, we demonstrate that iRafNet outperforms the original random forest based network inference algorithm (GENIE3), and is highly comparable to the community learning approach. We apply iRafNet to construct GRN in Saccharomyces cerevisiae and demonstrate that it improves the performance in predicting TF-target gene regulations and provides additional functional insights to the predicted gene regulations. AVAILABILITY AND IMPLEMENTATION The R code of iRafNet implementation and a tutorial are available at: http://research.mssm.edu/tulab/software/irafnet.html
منابع مشابه
Performance and Improvement of Tree-Based Methods for Gene Regulatory Network Reconstruction
Computational reconstruction of gene regulatory networks (GRNs) from gene expression data is of great importance in systems biology. Dialogue for Reverse Engineering Assessments and Methods (DREAM) challenge aims to evaluate the success of computational GRN inference algorithm on benchmarks of simulated data. Tree-based methods, such as Random Forest, infer true regulators of a target gene in a...
متن کاملFrom genetics to gene networks: Evaluating approaches for integrative analysis of genetic marker and gene expression data for the purpose of gene network inference
متن کامل
A Learning Framework to Improve Unsupervised Gene Network Inference
Network inference through link prediction is an important data mining problem that finds many applications in computational social science and biomedicine. For example, by predicting links, i.e., regulatory relationships, between genes to infer gene regulatory networks (GRNs), computational biologists gain a better understanding of the functional elements and regulatory circuits in cells. Unsup...
متن کاملPredictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks.
Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning f...
متن کاملSupplementary Information Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks
S1 Fraction of integrative network edges with and without physical support . . . . . 4 S2 Degree distributions of integrative networks . . . . . . . . . . . . . . . . . . . . . 5 S3 Structural properties of integrative networks . . . . . . . . . . . . . . . . . . . . . 6 S4 Network motifs of integrative networks . . . . . . . . . . . . . . . . . . . . . . . . 7 S5 Prediction of novel functional...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 31 شماره
صفحات -
تاریخ انتشار 2015